Pseudo 2-dimensional Hidden Markov Models in Speech Recognition

نویسندگان

  • Steffen Werner
  • Gerhard Rigoll
چکیده

In this paper, the usage of pseudo 2-dimensional Hidden Markov Models for speech recognition is discussed. This image processing method should better model the timefrequency structure in speech signals. The method calculates the emission probability of a standard HMM by embedded HMMs for each state. If a temporal sequence of spectral vectors is imagined as a spectrogram, this leads to a 2-dimensional warping of the spectrogram. This additional warping of the frequency axis could be useful for speakerindependent recognition and can be considered to be similar to a vocal tract normalization. The effects of this paradigm are investigated in this paper using the TI-Digits database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Hidden Markov models merging acoustic and articulatory information to automatic speech recognition

This paper describes a new scheme for robust speech recognition systems where visual information and acoustic features are merged. Using as robust unit the « pseudo-diphone », we compare a global Hidden Markov Model (HMM) and a Master/Slave HMM through a centisecond preprocessing and through a segmental one. We confirm by experimentation the importance of articulatory features in clean and nois...

متن کامل

Hidden Markov Models for Spatio-Temporal Pattern Recognition and Image Segmentation

Time and again hidden Markov models have been demonstrated to be highly effective in one-dimensional pattern recognition and classification problems such as speech recognition. A great deal of attention is now focussed on 2-D and possibly 3-D applications arising from problems encountered in computer vision in domains such as gesture, face, and handwriting recognition. Despite their widespread ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001